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Description 

Tliis invention relates to synthetic genes coding for horseradish peroxidase. 

Horseradish peroxidase C (E.C.I. 11 .1.7) (HRP) is the major peroxidase isozyme isolated from the 

5 horseradish (Armoracia rusticana ). It is a monomeric glycoprotein of 308 amino acids the polypeptide chain 
having a l\^W of 33,980 D. There are three neutral carbohydrate side chains and 4 disulphide bridges. The 
amino acid sequence of the mature protein has been determined. The presence of a pyrrolidonecarboxylyl 
amino terminus indicates that the protein is probably produced as a precursor form that is processed on 
secretion. The active form of the enzyme contains a hemin prosthetic group. 

10 The enzyme Is particularly stable and is amenable to crosslinking and derivitisatlon without excessive 
loss of activity. This together with its wide range of chromogenic substrates, some of which give rise to 
insoluble, chemiluminescent or fluorescent products, and the low background activities observed In most 
applications, have made horseradish peroxidase an Invaluable tool for diagnostic and research applications 
in the fields of immunology, histochemistry, cytology and molecular biology. A further advantage it presents 

15 over other enzymatic markers is that some some substrates for the enzyme give rise to electron dense 
products that allow correlation of peroxidase location with cellular ultrastructure using electron microscopy. 
In addition, horseradish peroxidase Is electron dense itself by virtue of the Fe it contains and as a result can 
act as an E.M. marker in its own right. Particular applications have been found in immunochemistry, where 
peroxidase cross linked to Immunoglobulin is widely used in both ELISA based assay systems and 

20 immunocytochemistry. Methods have been described that use either direct crosslinking of peroxidase to the 
immunoglobulin or indirect crosslinking of biotin labelled immunoglobulin to a streptavidin/horseradish 
peroxidase complex. Such streptavidin complexes have also found widespread application in nucleic acid 
hybridisation methods where biotinylated probe sequences can be localised by sequential incubation with 
the streptavidin/peroxidase complex and a suitable chromogenic peroxidase substrate. 

25 The amino acid sequence of horseradish peroxidase is taught by Welinder, K.G. (Eur. J. Biochem. 98, 
483-502 (1979)). The cloning of the cDNA or natural gene for horseradish peroxidase has not been 
described. 

In order to facilitate the dissection of the structure/function relationships of HRP, its incorporation into 
expression vectors and the production of novel chimeric proteins containing HRP functionality an improved 
30 novel synthetic gene for the peroxidase C produced by Armoracia rusticana is sought. 

It is by no means easy to predict the design of an improved HRP gene, since the factors that determine 
the expressibility of a given DNA sequence are still poorly understood. Furthermore, the utility of the gene 
In various applications will be influenced by such considerations as codon usage and restriction sites. The 
present invention relates to a synthetic HRP gene which has advantages In the ease with which It can be 
35 modified due to the presence of useful restriction sites. 

When synthesising and assembling genes, problems have been encountered when there are inverted or 
direct repeats greater than eight bases long in the genetic sequence. In addition, areas of unbalanced base 
composition such as G/C or A/T rich regions or polypurine/polypyrimidine tracts have been found to lead to 
inefficient expression. The present Invention seeks to overcome or at least alleviate these difficulties. 
40 According to a first aspect of the invention, there is provided DNA coding for horseradish peroxidase 
and including the following sequence: 
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The above sequence may be immediately preceded by an initiation codon (ATG) and immediately followed 
30 by a termination codon (TAA), but this will not necessarily be the case if the DNA incorporates linker(s) 
and/or extension(s), such as a sequence coding for a signal peptide, for example for efficient expression in 
eukaryotic cells such as mammalian cells. One extension which gives good expression in mammalian cells 
is a 5'-extension coding for the amino acids KCSWVIFFLMAWTGVNS, which may be provided between an 
initiation codon and the codon coding for the first Q residue. A preferred such extension Is shown in Figure 
35 6. A sequence coding for a 3'-slgnal sequence may code for LLHDMVEWDFVSSM; a preferred DNA 
sequence coding for this series of amino acid residues is also shown in Figure 6. Other useful 5' extensions 
are AAGCTTAACCATG and AAGCTTCATATG and a useful 3' extension is TAATAAGGATCCGAATTC. 

A synthetic HRP gene as described above incorporates useful restriction sites at frequent intervals to 
facilitate the cassette mutagenesis of selected regions. Also included in preferred embodiments are flanking 
40 restriction sites to simplify the incorporation of the gene into any desired expression system. 

Codons are those that are favoured by E. coll but It Is expected that the DNA would be suitable for 
expression in other organisms Including yeast and mammalian cells. 

According to a second aspect of the invention, there is provided a genetic construct comprising DNA 
according to the first aspect. A genetic construct in accordance with the second aspect may be a vector, 
45 such as a plasmid, cosmid or phage. 

According to a third aspect of the Invention, there Is provided a process for the preparation of DNA In 
accordance with the first aspect or a genetic construct in accordance with the second aspect, the process 
comprising coupling successive nucleotides and/or ligating appropriate oligomers. 

The invention also relates to other nucleic acid (including RNA) either corresponding to or complemen- 
50 tary to DNA in accordance with the first aspect. 

The Invention encompasses a process for the production of monodisperse horseradish peroxidase C 
comprising the expression of a genetic construct as described above. 

Further, the Invention extends to constructs as described above comprising a sequence in accordance 
with the first aspect fused to any other sequence of DNA so as to result in a sequence capable of encoding 
55 a hybrid protein possessing peroxidase activity. An example of such a construct is a genetic fusion between 
a gene encoding horseradish peroxidase and a gene encoding streptavidin or avidin such that the encoded 
fusion protein possesses both blotin binding and peroxidase activity. Another example Is a genetic fusion 
between a gene encoding horseradish peroxidase and a gene encoding an immunoglobulin-derived antigen 



3 



EP 0 382 762 B1 



binding function such that the fusion protein possesses both antigen binding and horseradish peroxidase 

activity. The antigen binding function may be an immunoglobulin heavy chain or light chain or fragments 

thereof or an engineered monomeric antigenic recognition site. 

Particular constructs of interest include: vectors comprising the gene for horseradish peroxidase C that 
5 enable the production of fusions between horseradish peroxidase and any other protein of interest; and 

expression vectors that provide for the co-expression of the gene for horseradish peroxidase and another 

gene of interest either as a single fusion product, as a single polyclstronic message or as two separate but 

linked transcriptional units. 

According to a further aspect of the invention, there Is provided a gene for horseradish peroxidase 
10 containing a mutation (either missense, nonsense, deletion, insertion, duplication or other re-arrangement) 

that destroys or impairs the activity of the encoded horseradish peroxidase protein. The invention extends 

to genetic constructs including all or a fragment of such a mutant horseradish peroxidase gene. 

Defective or non-defective horseradish peroxidase genetic constructas can be employed (for example 

as markers) in mammalian cells and/or in transgenic animals. 
15 Specific applications of synthetic genes for horseradish peroxidase, which themselves form further 

aspects of the invention, are disclosed in greater detail below: 

1) The gene can be incorporated into a suitable expression vector to allow for the efficient production of 
the enzyme in a compatible organism. This will have the advantage of being a ready source of a 
monodisperse enzyme preparation free of the contaminating isozymes present in the material isolated 

20 from horseradish root. Varying the organism or cell type chosen for production will also allow for the 
production of HRP with different patterns of glycosylation, including no glycosylation. Such material will 
have better defined properties that will make it more suitable for more demanding histochemical 
applications and sensitive enzyme assays, especially immunoassays. 

2) The gene can be incorporated into an HRP-streptavidin or HRP-avidin gene fusion. This will allow for 
25 the production of streptavidin-HRP or avidin-HRP complexes without the need for cross-linking. Again 

this will allow for a better defined, more stable product and will probably result in less loss of both biotin 
binding and peroxidase activity. 

3) Similarly, fusions between immunoglobulins and HRP or protein A and HRP can be produced that 
would be valuable histochemical reagents. Again the need for the usual cross-linking procedures would 

30 be avoided. 

4) The HRP gene would have valuable applications in the construction of vectors designed to allow the 
production of fusions between HRP and any other protein for which a gene or cDNA had been cloned or 
for which the amino acid sequence is known. This would be useful both for monitoring the expression of 
a gene the product of which is difficult to assay and to tag the protein of interest to allow its metabolism 

35 and pharmodynamics to be followed in vivo by the use of the appropriate histochemical techniques or 
enzyme assays. Additionally, HRP fusions wilt allow for a simple immunopurification of the fusion product 
through the use of an appropriate anti-HRP antibody. 

5) The expression of HRP will be a useful marker in expression systems, eg mammalian cell expression 
systems. The HRP gene could be expressed either as a fusion or on a polyclstronic message with the 

40 gene of interest, or as a separate but closely linked transcriptional unit. The production of the easily 
assayed HRP could be readily screened for and used as an indication as to which clones of cells were 
likely to be expressing large quantities of the deisred product. The use of fluorescent or chemilumines- 
cent HRP chromogenic substrates would allow for the possibility of directly selecting high producing 
eukaryotic cells by fluorescence activated cell-sorting (FACS). 

45 6) HRP genes carrying mutations (missense, nonsense, deletion, insertion, duplication or other re- 
arrangement) that destroy or impair the enzymatic activity of the resultant product would allow the 
construction of vectors that could be used to follow the frequency of reversion or suppression of the 
particular mutation introduced into the gene. 

The introduction of such defective HRP genes into the germ line of the organism of interest would 

50 also enable a researcher to fate-map particular cell-types by histologically examining the pattern of HRP 
activity in the tissue of interest. Care would have to be excercised in constructing a mutant HRP gene 
with the correct in vivo reversion rate so that areas of HRP activity and hence the presence of reverted 
HRP gene could be taken as evidence for the clonal origin of the HRP+ cells. The Intact synthetic non- 
mutant gene could also be used for such fate-mapping experiments by infection of an organism with the 

65 HRP gene in a siutable vector such as a retroviral vector or transposon. 

7) The advantage of a synthetic gene for HRP allows for the production of HRP genes modified to 
encode a protein carrying small additional sequences, such as N-or C- terminal extensions. These will be 
of great application in simplifying the purification of the HRP and/or increasing the ease and enhancing 
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the specificity with which it can be cross-linked to other proteins of interest or otherwise derivatised. For 
example, a C-terminal extension of six to eight Arg residues could be used to sinr^plify purification by 
analogy with the technique of Sassenfeld et al. Bio/techno(ogy 2 76 (1984). Alternatively, a tail of Lys 
residues would provide an accessible and sensitive site for reaction with bifunctional cross-linking 
5 reagents such as glutaraldehyde. 

Preferred embodinnents and examples of the invention will now be described. In the following 
description, reference is made to a number of drawings, in which: 

Figure 1 shows the amino acid sequence of horseradish peroxidase C; 

Figure 2 shows the sequence of the horseradish peroxidase synthetic gene; a summary of useful 
10 restriction sites; and a sequence of front and back halves of the gene that were Initially 

cloned; 

Figure 3 shows a sequence of synthetic horseradish peroxidase gene divided Into oligonucleotides; 
Figure 4 shows a summary of assembly procedure used; 
Figure 5 shows the structure of the HRP E. coll expression plasmid pSDI 8; 
15 Figure 6 shows a synthetic HRP gene modified for efficient expression in mammalian cells; and 
Figure 7 shows the structure of the HRP mammalian expression plasmid pCP21 . 



Example 1 

20 The gene was designed to be synthesized and cloned, In this example, in two halves with a final sub- 
cloning step to yield the full length gene. The sequence of the two halves of the gene together with that of 

the final product are depicted In Figure 2. The final synthetic gene encodes the entire mature horseradish 
peroxidase protein together with the required Initiator methionine residue but lacks the leader sequence that 
is assumed to be present in the natural gene. It Is envisaged that the leader sequence appropriate to the 

25 expression system of choice would be added to the synthetic gene as required or ommitted to allow for 
intracellular expression of the gene. 

The desired gene sequence was divided into a front half and a back half of 501 and 474 bp 
respectively. Both halves were designed with a common Xhol site to allow for the complete gene to be 
assembled with a simple cloning step. The front and back halves of the gene were divided into 24 and 22 

30 oligodeoxy ribonucleotides (oligomers) respectively as depicted in Figure 3. The division was such as to 
provide 7 base cohesive ends after annealing complementary pairs of oligomers. The end points of the 
oligomers were chosen to minimize the potential for inappropriate ligation of oligomers at the assembly 
stage. 

The oligomers were synthesized by automated solid phase phosphoramidite chemistry. Following de- 

35 blocking and removal from the controlled pore glass support the oligomers were purified on denaturing 
polyacrylamide gels, further purified by ethanol precipitation and finally dissolved in water prior to 
estimation of their concentration. 

All the oligomers with the exception of the 5' terminal oligomers BB279 and BB302 for the front half and 
BB303 and BB324 for the back half were then kinased to provide them with a 5' phosphate as required for 

40 the ligation step. Complementary oligomers were then annealed and the oligomers ligated together by T4 
DNA ligase as depicted in Figure 4. The ligation products were separated on a 2% low gelling temperature 
(LGT) gel and the bands corresponding to the front and back halves of the horseradish peroxidase gene 
were cut out and extracted from the gel. The purified fragments were then ligated separately to 
EcoRI/Hindlll cut DNA of the plasmid vector pUC18. The ligated products were transformed into HW87 and 

45 plated "onT L-agar plates containing 100 meg mr^ ampicillin. Colonies containing potential clones were then 
grown up in L-broth containing ampicillin at 100 meg ml~^ and plazmid DNA isolated. Positive clones were 
identified by direct dideoxy sequence analysis of the plasmid DNA using the 17 base universal primer, a 
reverse sequencing primer complementary to the opposite strand on the other side of the polylinker and 
some of the oligomers employed in the assembly of the gene that served as internal primers. One front half 

50 and one back half clone were subsequently re-sequenced on both strands to confirm that no mutations were 
present. The complete gene was then assembled by isolating the 466 bp Xhol-EcoRI fragment from the 
back half calone that contained the 3' end of the gene and ligating it to a fronThalf"cione that had also been 
digested with EcoRI and Xhol. The identity of the final construct was confirmed by restriction analysis and 
subsequent coTnpiete resequenelng. 

65 All the techniques of genetic manipulation used in the manufacture of this gene are well known to those 
skilled in the art of genetic engineering. A description of most of the techniques can be found in one of the 
following laboratory manuals: Molecular Cloning by T. Maniatis, E.F. Fritsch and J. Sambrook published by 
Cold Spring Harbor Laboratory, Box 100, New York. USA, or Basic Methods in Molecular Biology by L.G. 
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Davis, M.D. Dibner and J.F. Battey published by Elsevier Science publishing Co. Inc. New York, USA. 
Additional and modified methodologies are detailed below. 

1) Oligonucleotide synthesis 

5 

The oligonucleotides were synthesized by automated phosphoramldite chemistry using cyanoethyi 
phosphoramidtes. The methodology is now widely used and has been described (Beaucage. S.L and 
Caruthers, M.H. Tetrahedron Letters . 24, 245 (1981)). 

10 2) Purification of Oligonucleotides 

The oligonucleotides were de-protected and removed from the CPG support by incubation in con- 
centrated NH3. Typically. 50 mg of CPG carrying 1 micromole of oligonucleotide was de-protected by 
incubation for 5 hr at 70*^ in 600 mcl of concentrated NH3. The supernatant was transferred to a fresh tube 
75 and the oligomer precipitated with 3 volumes of ethanol. Following centrifugation the pellet was dried and 
resuspended in 1 ml of water. The concentration of crude oligomer was then determined by measuring the 
absorbance at 260 nm. 

For gel purification 10 absorbance units of the crude oligonucleotide were dried down and resuspended 
in 15 mcl of marker dye (90% de-ionised formamide, lOmM tris, 10 mM borate, ImM EDTA, 0.1% 

20 bromophenot blue). The samples were heated at 90° for 1 minute and then loaded onto a 1.2 mm thick 
denaturing polyacrylamide gel with 1.6 mm wide slots. The gel was prepared from a stock of 15% 
acrylamide, 0.6% bisacrylamide and 7M urea in 1 X TEE and was polymerised with 0.1% ammonium 
persulphate and 0.025% TEMED. The gel was pre-run for 1 hr. The samples were run at 1500 V for 4-5 hr. 
The bands were visualized by UV shadowing and those corresponding to the full length product cut out and 

25 transferred to micro-testubes. The oligomers were eluted from the gel slice by soaking in AGEB (0.5 M 
ammonium acetate, 0.01 M magnesium acetate and 0.1 % SDS) overnight. The AGEB buffer was then 
transferred to fresh tubes and the oligomer precipitated with three volumes of ethanol at -70° for 15 min. 
The precipitate was collected by centrifugation in an Eppendorf microfuge for 10 min, the pellet washed in 
80 % ethanol, the purified oligomer dried, redissolved in 1 ml of water and finally filtered through a 0.45 

30 micron micro-filter. The concentration of purified product was measured by determining its absorbance at 
260 nm. 

3) KInasing of oligomers 

35 250 pmole of oligomer was dried down and resuspended in 20 mcl kinase buffer (70 mM Tris pH 7.6, 
10 mM MgCI2, 1 mM ATP, 0.2 mM spermidine, 0.5 mM dithiothreitol). 10 u of T4 polynucleotide kinase was 
added and the mixture incubated at 37° for 30 min. The kinase was then inactivated by haeating at 85° for 
15 min. 

40 4) Annealing 

8 mcl of each oligomer was mixed, heated to 90° and then slow cooled to room temperature over a 
period of an hour. 

45 5) Ligation 

5 mcl of each annealed pair of oligomers were mixed and 10 X ligase buffer added to give a final ligase 
reaction mixture (50 mM Tris pH 7,5, 10 mM MgCb, 20 mM dithiothreitol, 1 mM ATP. T4 DNA ligase was 
added at a rate of 100 u per 50 mcl reaction and ligation carried out at 15° for 4 hr. 

50 

6) Agarose gel electrophoresis 

Ligation products were separated using 2% low gelling temperature agarose gels in 1 X TBE buffer 
(0.094 M Tris pH8.3, 0.089 M boric acid, 0.25 mM EDTA) containing 0.5 meg ml"^ ethidium bromide. 

55 
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7) Isolation of ligation products 

The band corresponding to the expected horseradish peroxidase gene or gene fragment ligation 
product was identified by reference to size mariners under long wave UV Illumination. The band was cut out 
5 of the gel and the DNA extracted as follows. 

The volume of the gel slice was estimated from its weight and then melted by incubation at 65° for 10 
min. The volume of the slice was then made up to 400 mcl with TE (10 mM Tris pH 8.0, 1 mM EDTA) and 
Na acetate added to a final concentration of 0.3 M. 10 meg of yeast tRNA was also added as a carrier. The 
DNA was then subjected to three rounds of extraction with equal volumes of TE equilibrated phenoi followed 
10 by three extractions with ether that had been saturated with water. The DNA was precipitated with 2 
volumes of ethanol, centrifuged for 10 min in a microfuge. the pellet washed in 70 % ethanol and finally 
dried down. The DNA was tai^en up in 20 mcl of TE and 2 mcl run on a 2 % agarose gel to estimate the 
recovery of DNA. 

75 8) Cloning of fragments 

For the initial cloning of the two halves of horseradish peroxidase 0.5 meg of pUCI 8 DNA was prepared 
by cleavage with Hindi II and EcoRI as advised by the suppliers. The digested DNA was run on an 0.8 % 
LGT gel and the vector band purified as described above. For the final assembly step the clone carrying the 
20 front half of the horseradish peroxidase gene was treated similarly using the enzymes Xhol and EcoRI. 

20 ng of cut vector DNA was then ligated to various peroxidase gene DNA ranging from 2 to 20 ng for 4 
hr using the ligation buffer described above. The ligation products were used to transform competent HW87 
as has been described. Ampicillin resistant transformants were selected on L-agar plates containing 100 
meg ml~^ ampicillin. 

26 

9) Isolation of plasmid DNA 

Plasmid DNA was prepared from the colonies containing potential horseradish peroxidase clones 
essentially as described (Ish-Horowicz, D., Burke. J.F. Nucleic Acids Research 9 2989-2998 (1981). 

30 

10) Dideoxy sequencing 

The protocol used was essentially as has been described (Biggin, M.D., Gibson, T.J., Hong, G.F. 
P.N.A.S. 80 3963-3965 (1983)). The method was modified to allow sequencing on plasmid DNA as 
35 described (Guo, L-H., Wu, R. Nucleic Acids Research 11 5521-5540 (1983). 

11) Transformation 

Transformation was accomplished using standard procedures. The strain used as a recipient in the 
40 cloning was HW87 which has the following genotype: 

araD139(ara-leu)del7697 (laclPOZY)del74 galU galK hsdR rpsL srI recA56 
Any other standard cloning recipient such as HB101 would be adequate. 

Example 2 

45 

The front end of the synthetic HRP gene prepared in Example 1 was modified by the replacement of 
the Hindlll-Hpal fragment with a synthetic linker carrying an Ndel site on the initiator ATG as follows: 



50 M Q L T . 

AAGCTTCATATGCAGTTAACC. • 
Ilindlll Ndel HEal 

55 
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Example 3 

Expression of the Synthetic horseradish peroxidase Gene in Escherichia coli 

5 The synthetic HRP gene of Example 2 was cloned into the expression vector pGC517 on a Ndel-BamHI 
fragment to give the plasmid pSD18. The host vector pGC517 was prepared from the known plasmid 
pAT153 (Twigg & Sherratt Nature 283. 216-218 (1980)), which is now a standard E. coli high expression 
vector, by the incorporation by standard methods of the known tac promoter sequence and a termination 
sequence. pAT153 is itself a derivative of pBR322. In pGC517 the HRP gene is expressed from the 

10 powerful and regulatabie tac promoter. To ensure that expression remained repressed In uninduced cultures 
the plasmid was maintained in E. coli strain W3110 lad", which is widely available, in which the lac 
repressor protein Is over-producedTFigure 5 depicts the structure of pSDl8. 

Strain W3110 lacl^-pSDIB was grown in M9 minimal medium containing 0.2% glucose and 0.2% 
casamino acids. AFan O.D, of 0.2 - 0.3 the culture was induced by the addition of IPTG to a final 

15 concentration of 5mM. The culture was grown for a further 3 hr with samples removed at 30 min intervals. 

Microscopic examination of the induced culture revealed the presence of inclusion bodies, characteristic 
of the accumulation of large amounts of insoluble aggregated protein within the cell. In addition, cultures 
expressing HRP at high levels acquired a pink colouration, perhaps related to the overexpression of a haem 
protein. SDS/PAGE analysis subsequently revealed the presence of a large amount of a 33 kD protein, 

20 estimated at 10-20% of total cell protein in induced but not uninduced cultures. Western blot analysis 
confirmed that this protein was HRP. 

Standard methods for inclusion body isolation could be applied to obtain a substantia! purification of the 
denatured HRP as insoluble aggregates. This material was then dissolved in 6 M guanidine HCI prior to 
renaturatlon. For renaturatlon, the dissolved HRP was dialysed against 8 M urea, 50mM Tris HCI, lOOmM 

25 NaCl for 24 hr. Ca^^ was then added (as CaCb) to 1 mM and the sample incubated for 2 hr at room 
temperature. This procedure resulted in the recovery of about 0.125% of the expected HRP activity by the 
standard pyrogallol colorimetric assay and based on the protein concentration and estimated purity of the 
preparation (see Table 1 ). 

Tabj.e 1 « Renaturatlon of hpp Expressed in E- cnli 



35 
40 


S 

a 

m 

P 
1 
e 


Cox^tions 


Rate of reaction 
(maximum) 
MJ/mn 


Amount of 
reoonbinant 
HRP C 
assayed 
mo^ 


Activity 
AU/min mog 
rec. HRP 


Activity 
(% of inax, 
activity of 
connnercial 
HRP) 


46 


1 


before 
1st 

dialysis 


0.01 AU/0,8 min 


25 mog 


5x10*^ AU/iain 
mog 


0.007% 


50 


2 


after 
1st 

dialysis 


0,015 AU/1.1 min 


5*77 inog 


0.0024 AU/min 
mog 


0.034% 


55 


3 


sanple 2 
incubated 
with 1 itM 
Ca2+ for 2h 


0.01 AU/1.5 min 


0,76 nog. 


0.029 AU/min 
mog 


0.125% 
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Control samples prepared from similar cultures carrying the expression plasmid without the HRP gene 
gave backgrounds about 1 000 fold less than this. The assay mixture contained freshly prepared pyrogallol 
and peroxide in the following concentrations: 11mM K phosphte, pH 6.0, 8mM H2O2, 0.55% w/v pyrogallol 
In H2O. The HRP was added and the Increase In adsorption at 420nm was followed. 
5 Thus the synthetic HRP gene is capable of high level expression In E. coll and is capable of directing the 
synthesis of active product. 

Example 4 

10 The synthetic HRP gene of Example 2 was modified as follows to allow for Its efficient expression In 
mammalian cells: 

(a) The 3' end of the gene was extended from the Pst 1 site to include the C-terminal extension reported 
by Fujiyama et al. Eur. J. Biochem. 173, 681-687 (1988). 

(b) The 5' en^ of the gene was modified by the addition of a Hindlll/Hpal linker which encoded a signal 
15 sequence based on an immunoglobulin signalpeptide. 

The modified HRP gene is depicted in Figure 6. and will be referred to as HPRX. 

Example 5 

20 Expression of the Synthetic Horesradish Peroxidase Gene in Mammalian Cells 

The HRPX gene of Example 4 was inserted into the mammlian cell expression vector pCPHU to give 
pCP21 , in which the HRP gene is expressed from the HCMV (Human Cytomegalovirus) early promoter, see 
Figure 7. The plasmid pCPHU is based on pUC18, which is widely avaialble and from which it can be 

25 prepared by standard methods, using the information in Figure 7. 

The HRP expression plasmid pCP21 was transfected into COS ceils using the standard technique of 
calcium phosphate precipitation (20mcg DNA transfected per 10^ cells). HRP activity was assayed in cell 
culture medium, 48-72h post transfection using tetra-methyl benzidine substrate (TMB), a standard HRP 
reagent. No HRP activity was detectable in control constructs which did not contain a signal sequence 

30 and/or the 3' extension. In contrast, HRP activity was clearly detectable in cells transfected with pCP21 (up 
to 1 0x greater than in controls). The results are shown in Table 2. 
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T^>?3r? 2 - HRP Exnression in COS Cells 

Vol. Extract. O.D.450 
(ncl) 

Plasmid 

£CP21 PCP22 PCPll PCP12 



100 


.004 


• .022 


.008 


.003 


50 


.033 


.012 


.010 


.013 


25 


.107* 


.010 


.003 


.016 


10 


.084* 


.011 


.007 


.017 


5 


.064* 


.015 


.012 


.011 


1 


.028 


.008 


.007 


.007 



KEY 

pCP21 HRP with N and C terminal signals, correct 
orientation, 

pCP22 HRP with N and C terminal signals, wrong 
orientation. 

pCPll HRP with no signal sequences, correct 
orientation. 

pCPl2 HRP with no signal sequences, wrong 
orientation. 



All results are the mean of duplicate samples. 
* = significant level of activity. 



HRP Assay 

For assaying cell extracts, a substrate mix was prepared as follows: 
TMB (3,3' ,5,5' tetramethyl benzidine (Sigma)) was dissolved to 10 mg/ml in DMSO and 100 mcl of this 
solution added to 100 nnt of assay buffer (0.1M NaAc in citric acid, pHG.O) along with 100 mcl H2O2. 

A cell extract was prepared by collecting the cells by centrifugation followed by freeze thawing or 
sonication. the medium, cell lysates and standards were atiquoted in 96 well microtitre plates as follows: 



Sample 


100 


50 


25 


10 


5 


1 


mcl 


Assay Buffer 


0 


50 


75 


90 


95 


99 


mcl 



Blank samples were set up using 100 mcl of assay buffer alone. 100 mcl of TMB/H2O2 mix was added 
to the samples of Incubated at RT for 30 mins to 1 hour. The reaction was stopped by the addition of 50 
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mcl of 2.5M H2SO4 and the colour change read at 450 nm on a plate reader. 

Commercially available HRP was used as a standard diluted by a factor of 10~^. 

Claims 

1. DNA coding for horseradish peroxidase and including the following sequence: 





ChG 


TTA 


ACC 


CCT 


ACA 


TTC 


TAC 


GAC 


AAT 


AGC 


TGT 


CCC 


10 


AAC 


GTG 


TCC 


AAC 


ATC 


GTT 


CGC 


GAC 


ACA 


ATC 


GTC 


AAC 




GAG 


CTC 


AGA 


TCC 


GAT 


CCC 


AGG 


ATC 


GCT 


GCT 


TCA 


ATA 




TTA 


CGT 


CTG 


CAC 


TTC 


CAT 


GAC 


TGC 


TTC 


GTG 


AAT 


GGT 


15 


TGC 


GAC 


GCT 


AGC 


ATA 


TTA 


CTG 


GAC 


AAC 


ACC 


ACC 


AGT 




TTC 


CGC 


ACT 


GAA 


AAG 


GAT 


GCA 


TTC 


GGG 


AAC 


GCT 


AAC 




AGC GCC 


AGG 


GGC 


TTT 


CCA 


GTG 


ATC 


GAT 


CGC 


ATG 


AAG 


20 


Mil 


GCC 


GTT 


GAG 


TCA 


GCA 


TGC 


CCA 


CGA 


ACA 


GTC 


AGT 


TGT 


GCA 


GAC 


CTG 


CTG 


ACT 


ATA 


GCT 


GCG 


CAA 


CAG 


AGC 




GTG 


ACT 


CTT 


GCA 


GGC 


GGA 


CCG 


TCC 


TGG 


AGA 


GTG 


CCG 




CTC 


GGT 


CGA 


CGT 


GAC 


TCC 


CTA 


CAG 


GCA 


TTC 


CTA 


GAT 


25 


CTG 


GCC 


AAC 


GCC 


AAC 


TTG 


CCT 


GCT 


CCA 


TTC 


TTC 


Ik 

ACC 




CTG 


CCC 


CAG 


CTG 


AAG 


GAT 


AGC 


TTT 


AGA 


AAC 


GTG 


GGT 




CTG 


AAT 


CGC 


TCG 


AGT 


GAC 


CTT 


GTG 


GCT 


CTG 


TCC 




30 


GGA 


CAC 


ACA 


TTT 


GGA 


AAG 


AAC 


CAG 


TGT 


AGG 


TTC 


ATC 




ATG 


GAT 


AGG 


CTC 


TAC 


AAT 


TTC 


AGC 


AAC 


ACT 


GGG 


TTA 




OCT 


GAC 


CCC 


ACG 


CTG 


AAC 


ACT 


ACG 


TAT 


CTC 


CAG 


ACA 


35 


CTG 


AGA 


GGC 


TTG 


TGC 


CCA 


CTG 


AAT 


GGC 


AAC 


CTC 


AGT 


GGA 


CTA 


GTG 


GAC 


TTT 


GAT 


CTG 


CGG 


ACC 


CCA 


ACC 


ATC 




TTC 


GAT 


AAC 


AAG 


TAC 


TAT 


GTG 


AAT 


CTA 


GAG 


GAG 


CAG 




AAA 


GGC 


CTG 


ATA 


CAG 


AGT 


GAT 


CAA 


GAA 


CTG 


TTT 


AGC 


40 


AGT 


CCA 


AAC 


GCC 


ACT 


GAC 


ACC 


ATC 


CCA 


CTG 


GTG 


AGA 




AGT 


TTT 


GCT 


AAC 


TCT 


ACT 


CAA 


ACC 


TTC 


TTT 


AAC 


GCC 




TTC 


GTG 


GAA 


GCC 


ATG 


GAC 


CGT 


ATG 


GGT 


AAC 


ATT 


ACC 


45 


COT 


CTG 


ACG 


GGT 


ACC 


CAA 


GGC 


CAG 


ATT 


CGT 


CTG 


AAC 




TGC 


AGA 


GTG 


GTC 


AAC 


AGC 


AAC 


TCT 











50 2. DNA as claimed in claim 1 , further including, at the 5'end, flanking DNA having one of the following 

sequences: 

AAGCTTAACCATG; or 
AAGCTTCATATG; 

or when the DNA is for expression in mammalian animals, the sequence: 
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AAGCTTCCACCATGAAGTGCTCCTGGGTGATCTTCraCCTGATGGCCGTTC 
CCGGCGTGAACTCC; 

5 

and, at the 3'end flanking DNA having the sequence: 
TAATAAGGATCCGAATTC: 

or, where the DNA is to be expressed in mammalian cells, the sequence: 

10 

CTACTCC^TGATATGGTGGAGGTCGTTGACTTTGTTAGCTCTATGTAATAAGGA- 
TCCGAATTC. 

75 

3. A genetic construct comprising DNA as claimed in claim 1 or claim 2. 

4. A construct as claimed in claim 3, which is a vector. 

20 5. A process for the preparation of DNA as claimed in claim 1 or claim 2, or a genetic construct in 
accordance with claim 3 or claim 4, the process comprising coupling successive nucleotides and/or 
ligating appropriate oligomers. 

6. A construct as claimed in claim 3 or 4 comprising the sequence defined in claim 1 fused to any other 
25 sequence of DNA so as to result in a sequence capable of encoding a hybrid protein possessing 

peroxidase activity. 

7. A construct as claimed in claim 6, wherein the other sequence of DNA is a gene encoding streptavidin 
or avidin such that the encoded fusion protein possesses both biotin binding and peroxidase activity. 

30 

8. A construct as claimed in claim 7, wherein the other sequence is a gene encoding an immunoglobulin- 
derived antigen binding function, such that the encoded fusion protein possesses both antigen binding 
and horseradish peroxidase activity. 

35 9. A construct as claimed in claim 8, wherein the antigen binding function is an immunoglobulin heavy 
chain or light chain or fragments thereof or an engineered monomeric antigenic recognition site. 

10. A construct as claimed in claim 4. which is an expression vector that provides for the co-expression of 

DNA as claimed in claim 1 or claim 2 and another gene of interest, either as a single fusion product, as 
40 a single polycistronic message or as two separate but linked transcriptional units. 

11. A process for the production of a monodisperse horseradish peroxidase C comprising expression of a 
genetic construct as claimed in claim 3 or claim 4. 

45 



50 



55 
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12. A set of oligonucleotides for the preparation of DNA as claimed in claim 1 or claim 2 comprising: 



_BB279 

. llllllllliillllilllllllllillllllllllllll iliiiill 

3' I&iXjJllACOTC^i^^rTGMS^^ CS^CAGGIT 
10 20 30 40 50 60 

BB280 

BB281 BB283 

IIIIIIIIII IIIIIIII UIIIIII lllllll llllllll llllllllllll 

70 80 90 100 110 120 
EB282 BB284 
BB 285 

llllllll IIIIIMIIIIIMIIIIIMMMIMIIIII Mill 

TAATCC^y3;;ajrGnA Gcn3LCiGAasAAGCvciri.ccpAa^ gacct 

130 . 140 150 160 170 180 

BB286 

BB287 BB289 

IIIIIUIIIIIIIIIIIIIII IIIIIII llllllllll llllllllllllll 

190 200 210 220 230 240 

BB288 BB290 

BB291 

llllllll IIIIIIIIIIIIIIIIIIIIMIIillllllllll III 

GAnnGGIC^Or^GCT AGOH^^CITCrSAiQSGC^AC^^ GIC 
250 260 270 280 290 300 

BB292 

BB293 BB295 

IIIIMMIMIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIMIIIIIII 

310 320 330 340 350 360 

BB294 EB296 

BB297 

CTGGAGAGTCCCS CTCSGTrCSACCTGACICrCL^ 

IIIIMIIIMII IIIIIIIIIMIIIIIIIIMMIIIIIIIIIIIII 

370 330 390 400 410 420 

E3293 

BB299 BB301 

IINIIIIIIIIIIIMIIIIIIIllllli MIMMIIIIIIIIIIIII 

430 440 450 460 470 480 

BB300 BB302 

I ! I II II II 1 1 1 II II 

490 500 
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BB303 

iiiiiiiiiiiiiiiiiniiiiiii iniiiiiiiiiiii n iiii 

2' MicaGacaciGgaftm^^ tost 

490 500 510 520 530 540 

BB304 

BB305 BB307 

llllllllllllllilllllllilllllll IIIIIMIIIIHU IIimil 

550 560 570 580 590 600 
BB3C6 BB308 
BB309 

610 620 630 640 650 660 

BB310 

BB311 

670 680 690 700 710 720 

EB312 

BB313 BB3 15 

iii iiiiiiiiiiiiiiiii iii iiMiiiiiminiiiiiii iiimn 

730 740 750 760 770 780 

BB314 BB316 

BB317 

cAcitsv cAoca3tna cia:yiGfl GAft tfirnt^^ ocncnTAAaGc 

II MM nilMIIIMimiMMIMMMIHMII MUM 

790 800 810 820 830 840 

BB318 

BB319 BB321 

MIMMIIIIIIIMNMMMMIIIIMI MIMNMMMN[MII 
GAACKr^TixjUG'iAocia^ GACiGcccaic^nuju;!' 

850 860 870 880 890 900 

BB320 BB322 
BB323 

MMIMMMIMM I MIIIMMI IMIIIIMIIIMM 

910 920 930 940 950 

BB324 
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PatentansprUche 

1. DNA, die Meerrettich- Peroxidase codlert und die folgende Sequenz umfaBt: 



CAG 


TTA 


ACC 


CCT 


ACA 


TTC 


TAC 


GAC 


AAT 


AGC 


TGT 


CCC 


AAC 


GTG 


TCC 


AAC 


ATC 


GTT 


CGC 


GAC 


ACA 


ATC 


GTC 


AAC 


GAG 


CTC 


AGA 


TCC 


GAT 


CCC 


AGG 


ATC 


GCT 


GCT 


TCA 


ATA 


TTA 


CGT 


CTG 


CAC 


TTC 


CAT 


GAC 


TGC 


TTC 


GTG 


AAT 


GGT 


TGC 


GAC 


GCT 


AGC 


ATA 


TTA 


CTG 


GAC 


AAC 


ACC 


ACC 


AGT 


TTC 


CGC 


ACT 


GAA 


AAG 


GAT 


GCA 


TTC 


GGG 


AAC 


GCT 


AAC 


AGC 


GCC 


AGG 


GGC 


TTT 


CCA 


GTG 


ATC 


GAT 


CGC 


ATG 


AAG 


GCT 


GCC 


GTT 


GAG 


TCA 


GCA 


TGC 


CCA 


CGA 


ACA 


GTC 


AGT 


TOT 


GCA 


GAC 


CTG 


CTG 


ACT 


ATA 


GCT 


GCG 


CAA 


CAG 


AGC 


GTG 


ACT 


CTT 


GCA 


GGC 


GGA 


CCG 


TCC 


TGG 


AGA 


GTG 


CCG 


CTC 


GGT 


CGA 


CGT 


GAC 


TCC 


CTA 


CAG 


GCA 


TTC 


CTA 


GAT 


CTG 


GCC 


AAC 


GCC 


AAC 


TTG 


CCT 


GCT 


CCA 


TTC 


TTC 


ACC 








CTG 


AAG 


GAT 


AGC 


TTT 


AGA 


AAC 


GTG 


GGT 














CTT 


GTG 


GCT 


CTG 


TCC 


GGA 








TTT 




A AP 


A A P 


CAG 


TGT 


AGG 


TTC 


ATC 


i\X\>3 




Ann 




TAP 




TTr* 


A PP 


A A P 

AAC 


A 

ACl 




TTA 
1 lA 


CCT 


GAC 


CCC 


ACG 


CTG 


AAC 


ACT 


ACG 


TAT 


CTC 


CAG 


ACA 


CTG 


AGA 


GGC 


TTG 


TGC 


CCA 


CTG 


AAT 


GGC 


AAC 


CTC 


AGT 


GCA 


CTA 


GTG 


GAC 


TTT 


GAT 


CTG 


CGG 


ACC 


CCA 


ACC 


ATC 


TTC 


GAT 


AAC 


AAG 


TAC 


TAT 


GTG 


AAT 


CTA 


GAG 


GAG 


CAG 


AAA 


GGC 


CTG 


ATA 


CAG 


AGT 


GAT 


CAA 


GAA 


CTG 


TTT 


AGC 


AGT 


CCA 


AAC 


GCC 


ACT 


GAC 


ACC 


ATC 


CCA 


CTG 


GTG 


AGA 


AGT 


TTT 


GCT 


AAC 


TCT 


ACT 


CAA 


ACC 


TTC 


TTT 


AAC 


GCC 


TTC 


GTG 


GAA 


GCC 


ATG 


GAC 


CGT 


ATG 


GGT 


AAC 


ATT 


ACC 


CCT 


CTG 


ACG 


GGT 


ACC 


CAA 


GGC 


CAG 


ATT 


CGT 


CTG 


AAC 


TGC 


AGA 


GTG 


GTC 


AAC 


AGC 


AAC 


TCT 











2. DNA nach Anspruch 1 , die zusatzlich am 5*-Ende flankierende DNA mit einer der folgenden Sequenzen 
umfaSt: 

AAGCTTAACCATG; Oder 
AAGCTTCATATG; 

Oder, wenn die DNA fur die Expression in Saugerzellen eingesetzt wird. die Sequenz: 



AAGCTTCCACCATGAAGTGCTCCTGGGTGATCTTCTTCCTGATGGCCGTGGTGA- 
CCGGCGTGAACTCC ; 
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und am 3'-Ende flankierende DNA mit der Sequenz: 
TAATAAGGATCCGAATTC; 

Oder, wenn die DNA in Saugerzellen exprimiert werden soil, die Sequenz: 

CTACTCCATGATATGGTGGAGGTCGTTGACTTTGTTAGCTCTATGTAATAAGGA- 
TCCGAATTC. 



10 

3. Gentechnologisches Konstrukt, umfassend DNA nach Anspruch 1 Oder 2. 

4. Konstrukt nach Anspruch 3, namlich ein Vektor. 

75 5. Verfahren zur Herstellung von DNA nach Anspruch 1 Oder 2. Oder eines gentechnologlschen Konstrukts 
nach Anspruch 3 Oder 4, wobei das Verfahren die aufeinanderfolgende Kupplung von Nucleotiden 
und/oder die Ligierung geeigneter Oligomere umfaBt. 

6. Konstrukt nach Anspruch 3 oder 4, umfassend die Sequenz gemSB der Definition in Anspruch 1. die 
20 mit einer anderen DNA-Sequenz fusioniert ist, so daB eine Sequenz erhalten wird, die ein Hybrldpro- 

tein, das PeroxidaseaktivitSt besitzt, codieren kann. 

7. Konstrukt nach Anspruch 6, wobei die andere DNA-Sequenz ein Gen ist, das Streptavidin oder Avidin 
codiert, so daB das codierte Fuslonsprotein sowohl Blotinbindungsals auch Peroxidase-AktivitMt besitzt. 

25 

8. Konstrukt nach Anspruch 7, wobei die andere Sequenz ein Gen ist. das eine von einem Immunglobulin 
stammende Antigenbindungsfunktion codiert, so da6 das codierte Fuslonsprotein sowohl 
Antigenbindungs- a!s auch Meerrettich-Peroxidase-AktlvitSt besitzt. 

30 9. Konstrukt nach Anspruch 8, wobei die Antigenbindungsfunktion eine schwere oder leichte Immunglobul- 
inkette oder Fragmente davon ist, oder eine konstruierte monomere Antigenerkennungsstelle. 

10. Konstrukt nach Anspruch 4, namlich ein Expressionsvektor, der die Coexpression von DNA nach 
Anspruch 1 oder 2 und eines anderen Gens von Interesse entweder als Einzelfusionsprodukt. als 

35 einzelne polycistronische Botschaft oder als zwei getrennte, aber verkntipfte Transkriptionseinheiten 
ermoglicht. 

11. Verfahren zur Herstellung einer monodispersen Meerrettich-Peroxidase C. umfassend die Expression 
eines gentechnologischen Konstrukts nach Anspruch 3 oder 4. 

40 



45 



50 



55 
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70 



20 



30 



12. Oligonucleotid-Set fOr die Herstellung von DNA nach Anspruch 1 Oder 2, umtassend: 



BB279 

lljlllllllllMIIIIIIIIIIIIIIIIIM llllllll 

10 20 30 40 50 60 

BS230 

BS231 fi3283 

iiiiHiiii iniiiii iiiiiiii iiiiMiiiiiiiiiiiiiiiiiiiii 

70 80 90 100 UO 120 

ES232 E3284 
15 M BB2S5 

llllllllllllllllllllllllllllllil_ , IllJi 




120 . 140 150 160 170 ISO 

EB236 

E3237 ES2S9 

iiiiiiiii iiiiilllliiii iiiiiii I II III mill iiiiii mill 

190 200 210 220 220 240 

25 BB288 EB2S0 

EB291 

llllllll III II 1 1 II I II lllltlll 111 I II II I IIIIII III 

250 260 270 230 290 200 

E3292 

BB293 ' . EH295 



iiiiiiiiiiiiniiiiiiiiiiiii iiiiiii iiiiiiiiiiii'.iiiiii 

35 320 330 240 350 360 

EB294 ES296 

532S7 ^ 

iiimiiiiiii iiiiiiiiiiiii iiii!iii iiiiiiiiijiiiii 

370 330 3S0 4C0 • 410 420 

=3293 

£3299 E3301 

■ iiniiiiiiiiiiiiiiiiiiiliillii . iiiiiiiiiiiiiiiijiii 

420 440 450 460 470 430 

ES300 E33C2 



50 l!!llllllll!!lll 

490 500 



55 
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IIIIIIIIIIIIIIIIIIIIIIIIIUI IIIIIIIIIIIII II nil 

2' ATrsas c r cRc: T :G AAcurs^^ tost 

490 500 510 S20 530 SAO 



BB304 

EB305 



BB3Q7 




550 S60 570 580 »0 600 

£3306 ®308 

EB309 _ 

610 ■ 620 630 640 650 660 

EB3X0 

670 680 690 700 7X0 720 

BB312 

£3313 BB3 1J5 

730 740 750 760 770 780 

EB314 

BB317 

nnT? ^immiiiiiiiiiiiiiiiiiiiiii iMi mill 

790 800 810 820 830 840 

BB318 

BB319 
^ BB320 

E3323 ^ 

irSiuim ^nImmiii!!iiii!iJ4'''' 

^^^IrT^'*"^ "^^-^ 

B8324 
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Revendlcatlons 

1. ADN codant pour la peroxydase de raifort et comprenant la sequence sulvante : 

CAG TTA ACC CCT ACA TTC TAG GAC AAT AGC TGT CCC 
AAC GTG TCC AAC ATC GTT CGC GAC ACA ATC GTC AAC 
GAG CTC AGA TCC GAT CCC AGG ATC GCT GCT TCA ATA 
TTA CGT CTG CAC TTC CAT GAC TGC TTC GTG AAT GGT 
TGC GAC GCT AGC ATA TTA CTG GAC AAC ACC ACC AGT 
TTC CGC ACT GAA AAG GAT GCA TTC GGG AAC GCT AAC 
AGC GCC AGG GGC TTT CCA GTG ATC GAT CGC ATG AAG 
GCT GCC GTT GAG TCA GCA TGC CCA CGA ACA GTC AGT 
TGT GCA GAC CTG CTG ACT ATA GCT GCG CAA CAG* AGC 
GTG ACT CTT GCA GGC GGA CCG TCC TGG AGA GTG COG 
CTC GGT CGA CGT GAC TCC CTA CAG GCA TTC CTA GAT 
CTG GCC AAC GCC AAC TTG CCT GCT CCA TTC TTC ACC 
CTG CCC CAG CTG AAG GAT AGC TTT AGA AAC GTG GGT 
CTG AAT CGC TCG AGT GAC CTT GTG GCT CTG TCC GGA 
GGA CAC ACA TTT GGA AAG AAC CAG TGT AGG TTC ATC 
ATG GAT AGG CTC TAC AAT TTC AGC AAC ACT GGG TTA 
CCT GAC CCC ACG CTG AAC ACT ACG TAT CTC CAG ACA 
CTG AGA GGC TTG TGC CCA CTG AAT GGC AAC CTC AGT 
GCA CTA GTG GAC TTT GAT CTG CGG ACC CCA ACC ATC 
TTC GAT AAC AAG TAC TAT GTG AAT CTA GAG GAG CAG 
AAA GGC CTG ATA CAG AGT GAT CAA GAA CTG TTT AGC 
AGT CCA AAC GCC ACT GAC ACC ATC CCA CTG GTG AGA 
AGT TTT GCT AAC TCT ACT CAA ACC TTC TTT AAC GCC 
TTC GTG GAA GCC ATG GAC CGT ATG GGT AAC ATT ACC 
CCT CTG ACG GGT ACC CAA GGC CAG ATT CGT CTG AAC 
TGC AGA GTG GTC AAC AGC AAC TCT 

2. ADN tel que d^fini h la revendication 1, comprenant en outre, h I'extr^mit^ 5', un ADN adjacent ayant 
I'une des sequences suivantes : 

AAGCTTAACCATG ; ou 
AAGCTTCATATG ; 

ou, lorsque I'ADN est destinS it I'expression dans des cellules de mammif^re, la sequence : 

AAGCTTCCACCATGAAGTGCTCCTGGGTGATCTTCTTCCTGATGGCCGTGGTGA- 
CCGGCGTGAACTCC ; 
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et, k rextremit^ 3\ un ADN adjacent ayant la sequence : 
TAATAAGGATCCGAATTC ; 

ou, si TADN est destine h etre exprime dans des cellules de mammifere, la sequence : 

5 

CTACTCCATGATATGGTGGAGGTCGTTGACTTTGTTAGCTCTATGTAATAAGGA- 
TCCGAATTC. 

10 

3. Prodult de synthase g^n^tique comprenant TADN tel que d^flni h la revendicatlon 1 ou ^ la 
revendication 2. 

4. Prodult de synthese tel que defini a la revendicatlon 3, qui est un vecteur. 

75 

5. Proc6d6 de preparation de I'ADN tel que defini k la revendicatlon 1 ou la revendicatlon 2, ou d'un 

produit de synthase g^netique tel que defini k la revendication 3 ou ^ la revendication 4, le proc^d^ 
comprenant le couplage de nucleotides successifs et/ou la soudure d'oligomeres appropries. 

20 6. Produit de synthese selon la revendication 3 ou 4, connprenant la sequence telle que definie k la 
revendication 1. fusionn^e k n'importe quelle autre sequence d'ADN, de fagon k conduire k une 
sequence capable de coder pour une proteine hybride poss^dant I'activite peroxydase. 

7. Produit de synthase selon la revendication 6, dans lequel I'autre sequence d'ADN est un g§ne codant 
25 pour la streptavidine ou I'avidine, de telle sorte que la proteine de fusion codee possede k la fois 

Tactivit^ de liaison k la biotine et I'activite peroxydase. 

8. Produit de synthase selon la revendication 7, dans lequel Tautre sequence est un ghne codant pour 
une fonction de liaison a un antigene issu d'une immunoglobuline, de telle sorte que la proteine de 

30 fusion codee possede k la fois I'activite de liaison k un antigene et I'activite peroxydase de raifort. 

9. Produit de synthase selon la revendication 8, dans lequel la fonction de liaison k un antigene est une 
chaTne lourde ou une chaTne legere d'immunoglobuline, ou des fragments de celles-ci, ou un site de 
reconnaissance antigenlque nnonomere obtenu par genie genetique. 

35 

10. Produit de synthese selon la revendication 4, qui est un vecteur d'expression qui assure la co- 
expression de I'ADN tel que defini k la revendication 1 ou ^ la revendication 2 et d'un autre g^ne 
presentant un inter§t, soit sous la forme d'un prodult de fusion unique, soit sous la forme d'un message 
polycistronique unique, soit sous la forme de deux unites de transcription separees mais liees. 

40 

11. Precede de fabrication d'une peroxydase de raifort C monodispersee, comprenant I'expression d'un 
produit de synthese genetique tel que defini a la revendication 3 ou a la revendication 4. 

12. Jeu d'oligonucleottdes pour la preparation de I'ADN tel que defini a la revendicatlon 1 ou a la 
46 revendication 2, comprenant : 



50 



55 
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_ „^7^ 

10 20 30 40 50 60 

BB280 

SB231 BB283 

70 80 90 100 no 120 

" BE2a2 BB284 
^ BB 285 

120 . 140 150 160 170 180 

BB236 

SS237 EB289 

'''''.''''^'H''''''U!.'.!.!''.-, mimin iiHniiiiHii 

190 200 210 220 230 240 

B3288 BB290 
• EB291, 

'iiUilL__ iiiNMiiMiiiimiiMimiMimii m 

250 260 270 280 290 300 

6B292 

BB293 BB295 

IIIMIIIIimilimilllllll lllllll lllllllllllllllilll 

310 320 330 340 350 360 

EB294 EB296 

EB297 

IIIIIIINIIM IIIIIIIIMIIIINIMIIIIMIIIIIIIIIII 

370 3S0 350 400 410 420 

£3293 

E5299 BB301 



Mllllllllilllllliiillliiiii llllllllllllllllliii 

h30 *140 450 460 470 480 

BB300 ■ ESj02 

TCTCAATCSCrCOA^ 3' 

i!iiiiiimimj_ 

490 500 
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B B303 

„ imiiimiiiiiiiiiNiiiiiiiiimii^ mi 

490 500 510 520 530 540 

BB304 

BB30 5 BB307 

iiiiiiimiiiiiiiMiiiiiiiiiii iiiiiiiiiiiiiii^^ 

550 560 570 580 590 600 
BB306 QS308 
HB309 

nijJjJNmiii _ imiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

610 620 630 640 650 660 

BB310 

BB311 

INIIliiliflllllllllllllllllllllllll lilllliillll 

670 680 690 700 710 720 

fiB312 

BB313 BB315 

UJJJJJJ^^ iiiimimiiniiiiumiiiiii 

730 740 750 760 770 780 

EB314 BB316 

BB317 

mill iiiiiiiiiiiiiiiiiiiiiiiiiiMiiiiii Mini 

GIGAOCTGSSG GCHGACCSjCICITCV^ ATIGOG 
790 800 810 820 830 840 

BB318 

BB319 BB321 

lllllllltlllllillllllllllllllllll IIIIMIMIIIIIMIIII 

850 860 870 880 890 900 

BB320 B5322 
BB323 

GATinJXXntlAACrGC AG;;CTCCrrCAACAGCA^ 3' 

iiiiiiiiiiiiiiii niiiniiii iiiMiiiiiiHiii 

910 920 930 940 950 

BB324 
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FIGUHE 1 

AMEND ACID SBJJENCE OF HORSERADISH PERDXmASE (INCLUDING 



(M)QLTPTFYDNSCPNVSNIVR 

10 20 

DTIVNELRSDPRIAASILRL 

30 40 

HFHDCFVNGCDASILLDNTT 

50 60 

SFRTEKDAFGNANSARGFPV 

70 80 

IDRMKAAVESACPRTVSCAD 

90 100 

LLTIAAQ QSVTLAGGPSWRV 

110 120 

PLGRRDSLQAFLDLANANLP 

130 140 

APFFTL PQLKDSFRNVGLNR 

150 160 

SSDLVALSGGHTFGKNQCRF 

170 180 

IMDRLYNFSNTGLPDPTLNT 

190 200 

TYLQTLRGLCPLNGNLSALV 

210 220 

DFDLRTPTIFDNKYYVMLEE 

230 240 

QKGLIQSDQELFSSPNATDT 

250 260 

IPLVRSFANSTQTFFNAFVE 

270 280 

AMDRMGNITPLTGTQGQIRL 

290 300 

NCRVVNSNS 

309 
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STGOHE 2a 

SEC^JHTCB OF SYmHETIC HORSERADISH PEPOKZDASE GE3XE 



MQLTPTFYDNSCPNVSN 
AAGCTIAAOCMGCAGITA^ 

HinDIU 1^ 

10 20 30 40 50 60 

IV R DTIVNELRSDPRIAASI 

NrnI Sad Sspl 

70 80 90 100 110 120 

LRLHFHDCFVNGCDASILLD 

Nhel 

130 140 150 160 • 170 180 

NTT S F RTEKDAFGNANSARG 

CAACACXAOcaGTriarsc^^ 

Nsil 



190 



200 



210 



220 



230 



240 



FPVIDRMKAAVESACPRTVS 



Clal/Pvul SphI 
GAAASGTC3^CiaGCIM0CTA^^ 

250 260 270 280 290 300 

CA DLLTIAAQQSVTLAGGPS 
TTCTCCAGAOnXXTOACTMaGC^^ 

Bsg gg F^ RsrII 

MCAOGTCIGGACXaACIG^^ 

310 320 330 340 350 360 

WRVPLGRRDSLQAFLDLANA 
CTOGAGAGIGOC3GCTCX3GTOGAa^^ 

Sail BglH/Ball 

GACxntnx:s\OGGcx;AGCxs^GC^^ 

370 380 390 400 410 420 

NLPAPFFTLPQLKDSFRNVG 

Pvun 

CnrcaWISGAOGftQGER^^ 

430 440 450 460 470 480 

LNRSSDLVALSGGHTFGKNQ 

TCTCAAiasciasAGi^ 

Xhol BspMII 
ItfSAcnTftGCXSaGCTCACIGGR 

490 500 510 520 530 540 
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C R FIMDRLYNFSNTGLPDPT 

BstEII 

550 560 570 580 590 600 

^ ^ ^ .JLJ? ^ ^ ^ g L C P L N G N L S 

SnaBI PflMI 
OGACITCTCATCCSiTftGAGC^^ 

610 620 630 640 650 660 

A L V D F DLRTPTIFDNKYYVN 

ApaTT/Spel Seal 

670 680 690 700 710 720 

LEEQKGLIQSDQELFSSPNA 
TCn3^GAGGAGCaGAAAGGOC3t3^^ 
Xbal Stui Bell 

AGArcia:ia?rciTix^^ 

730 740 750 760 770 780 

TDTIPLVR SF A NSTQTFFNA 
CACIGACAOamXXACTCGIGAG^^ 
B stXI 

GIGACTGlGCJrAGGGIGACCACTCT^^ 

790 800 810 820 830 840 

FVEAMDRMGNITPLTGTQGQ 

CTT0CnX3GAAGCCMa]ACC^^ 

Ncoi iqpni 



850 



860 



870 



880 



890 



900 



I R^ ji NCRVVNSNS** 
GATIXCTCIGAACTGCAGAG?IGC^^ 

PstI BanHI EodRI 

CT^AGCAGACTIGACXJKn^ 

910 920 930 940 950 
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FI60PE 2b 

SUMMARY OF USEFUL RESTElLCnON SITES, 



El^ZYME SBQUDICE POSTEION 

HinDIII AAGCIT 1 

ISpal GITAAC 16 

Sad GAGCTC 86 

Sspl AAXATT 118 

Nhel GCEftGC 164 

Nsil MGCM* 210 

Clal ATOGAT 251 

Pvul GGATOG 253 

SphI GCKLGC 281 

BspMI AOCTCC 309 

Fspl TGCGCA 325 

RsrII OQGAOOG 352 

Sail GTOGAC 378 

Bglll AGAICr 406 

Ball TGGCCA 411 

PvuII CAGCTC 452 

Xhol CrOGAG 490 

BspMII TOOGGA 512 

BstEII GGTIAa: 585 

SnaBI TAOSEA 610 

PflMI 0C3^CrGAATCG 641 

Apall GIGCAC 660 

Spel * . ACIAGT 664 

Seal AGTACT 708 

Xbai TCIAGA 721 

StuI AGGOCT 736 

Bell TGATCA 751 

BstXI OCAiarACTGG 789 

Ncol CXATQG 852 

I^snl GGXAOC 887 

PstI CTGOG 913 

Banm GGATOC 944 

EcoRI GAATTC 950 
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FIGURE 2C 

SEQJWCE OF SWIHEnC HQBSEIU^DISH FEPCfXILfSE 5' WD HALF 



^ ^ ^ ^ P_ JLJ DNSCPNVSN 
AftGCIllRROCKIGCaGriA^ 

HinPm ^Ht> al_ 

TICG3^ATICGTAjOGTC3UVriX3GGC^^ 

10 20 30 40 50 60 

IVRDTIVNELRSDPRIAASI 

Nnil Sad Ssp] 

70 80 90 100 HO 120 

LRLHFHDCFVNGCDASILLD 

Nhel 

TAATCCAGAOGTCAAGCnACTCAC^ 

130 140 150 160 170 180 

NTT S F RT EKDAFGNANSARG 

IfeU 

GTlX;iGglXX;iCAAftGGCCTXSACiTna:i^^ 

190 200 210 220 230 240 

F P V I D R M K A AVE SACP.RTVS 
CTTTOCAGnDSAIXX^AiaXAIG^ 

Clal/Pvul SphI 
GAAftGCnxaCiaGCTaGOGaSOT^ 

250 260 270 280 290 300 



A DLLTIAA QQSVTLAGGPS 
BspMI Fqpl RsrH 



310 



320 



330 



340 



350 



360 



WRVPLGRRDSLQAFLDLANA 
OGGAGAGIGOCXSCI^^ 

SaU BglH/Ball 
GACXITCIX^VOGGOGaGCXaGC^^ 

370 380 390 400 410 420 

N L PAP F F TLPQLKDSFRNVG 

PVUII 

GTIGAACX3GACXSAGGrEAAGAAGr[GGG^ 

430 440 450 460 470 480 



L N R S 

Xhol EoqEU 
AGaCITAGOGftGC lUJi ' llA A^ 
490 
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FIGORE 2d 

SEQUENCE OF SYNIHETTIC HDRSERADISH FBRQXIDASE GENE 3' END HAIf . 



SSDLVALSGGHTFGKNQ 
HigDIII »loI BspHU 

490 500 510 520 530 540 

R F I MDRLYNFSNT6LPDPT 

iiX\iiitfiUXUUATCAIGGA33^^ 

BstEII 

550 560 570 580 590 600 

LNTTYLQTLRGLCPLNGNLS 
GCTGAAO^COAOGrAT^^ 

SnaBI PflMI 
CXaCITGTCATCCA!^^ 

610 620 630 640 650 660 

A L V D F D LRTPTIFDNKYYVN 

ApaLI/Spel Seal 

670 680 690 700 710 720 

LEEQKGLIQSDQELFSSPNA 
TCTAGAQGAGCaGAAAGGOCIGA!^^ 
Xbal Stui BcU 



730 



740 



750 



760 



770 



780 



TDTIPLVRSFANSTQTFFNA 
BstXI 

GTCACIXI[GGI3\GGG3X3A^ 

790 800 810 820 830 840 

FVEAMDRMGNITPLTGTQGQ 

Nool I^inl 

850 860 870 880 890 900 

IRLNCRVVNSNS** 
GATTOnCIGAACIGCAGAGI^^ 

PstI BainHI BcoRI 

CTAAGCAGACITCAan^^ 

910 920 930 940 950 
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FI6UBE 3a 

DESIGN OF OLLCCMEPS FOR SYNIHETIC HORSERADISH PEI^DXIDASE GENE 5'HAIP. 



BB279 

lllillllllllllllllllllllllllllllllliilil iiiiiiii 

10 20 30 40 50 60 

BB280 

BB281 BB283 

MIIMIIIIJJJjMIIMIIII lllllllllllllllllllllllllll 

GUiGCMiGCCCIUJ^^ TAOGCZAGGCTDOCTMXGAOGA/USI^ 
70 80 90 100 110 120 

BB282 BB284 

BB 285 

AITAOGTC TGCSOTOCAIGACIXXirTanGAA^ CXTKLTP^CTGCA 

lllillll lllllllllllllllllllllllllllllllll Hill 

TAAlTGCAGAOGIGAA GGTACIGAOoAAGCAjCTEAOCAAaXl^ GAOCT 
130 140 150 160 170 180 

BB286 

BB287 BB289 
aWVCACOOS^CICTItXC^ AKXSOTOSGGAACXXnJACAGOG^ 
IIIIIIIIIIIIMIIIIIMIJJII^ llllllllllllllllllllllil 

190 200 210 220 230 240 

BB288 BB290 
BB291 

CrrrcCAG TGAIiaSATOGaVIGAAGGCIG^^ GAAGAGIGAG 

250 260 270 280 290 300 

BB292 

BB293 BB295 

310 320 330 340 350 360 

BB294 BB296 

BB297 

CIGGAGAGIGCXX; CTCGGTCCAOJTGPJ:^^ 

lllllllllllll llllllllllllllllllllllllllllllllllll 
GAOCTCICAa3GCXyVG(X2^ CIQCfClXSMSC^^ 

370 380 390 400 410 420 

B8298 

EB299 BB301 
AOGCCAACrriXSCCTGCTC^^ GCIGAAa3A33^GCmMAAAa 

llllllllillllllllillilllllllll IIIIIIIIIIIIIMIIIII 
GAAa3GA0(IAGGIAAGAAGrGGGA0GGGGIC^^ CTAIOSAAAimiUaoaC 

430 440 450 460 470 480 

BB300 BB302 

TCTGAATOGCTOGAGG 3' 

llllllllllllllll 
AGACTTAGOGAGClCCriAA 5' 

490 500 
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XIGDRB 3b 

DESIGN OP QLIQCMERS FOR SXNIHEIIC IKRSERAniSH lEWJOEftSE GENE 3' HMF. 

BB303 

„ lllllll lllllllllllllllllllllllllll "^^^^tf^f 

490 500 510 520 530 540 

BB304 

BB305 BB307 

550 560 570 580 590 600 

BB306 BB308 

BB309 

iiiiiiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiimiirii 

620 630 640 650 660 

EB310 

BB311 

670 680 690 700 710 720 

BB312 

BB313 . BB315 

JiNii inmii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

730 740 750 760 770 780 

BB314 BB316 

BB317 

J4JiiU^ ini'"Ui'iiiiiiiiiiiiiiiiiiiiiiii imii 

790 800 810 820 830 840 

BB318 

BB319 BB321 

850 860 870 880 890 900 

BB320 BB322 
BB323 

Ml MM Mil II I II imillimill Mill lllllll! 

910 920 930 940 950 

BB324 
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SUMMftKY OF ASSIMBL^ HWEDURE, 5' HALF. 

a) klnased ollgcners annealed in pairs and ndxed in tMo oxosss 
(A & B) . 

BB279 BB281 BB283 BB285 BB287 

MCTT p p p p p- 

A p p p p p 

EB280 BB282 HB284 BB286 BB288 BB290 

BB291 BB293 BB295 BB297 BB299 BB301 

P P P p p p G 

p p p p p ciTAA 

BB292 BB294 BB296 BB298 BB300 BB302 

b) oliganets ligated together in two groqps, (BB279 and BB302 not 
kinased to avoid Tniltimprisation. 



fragment A 

AGCrr 



-ACTAGCT 



p fragment B 

aXSKKXSA — ^Xhol— G 

' " — Xhol — CTERA 

c) Hie ligations vers checked for the presence of ftaginent A & B on 
2% agarose gels then the ligation reactions were i"^v»h and the 
reaction edlo^^ to oontiiiue to give the ^inal product. 

AGCrr TGATOGA- ^Xhol— G 

A ftCCRGCP- ^Xhol— CmA 

d) ihe HRP gene fragment was isolated on a 2% ICT agarose gel and 
cloned into EooRI/HinDIII cut pOdS. 

HRP 5' half p 

v\AAm AGcrr ^Xhoi-^ aattcvww 

wwttoga A — — -Xhol— CTEAA gwwv 

P 



HinDin HRP 5' half BobRI 

wwaAGClT — ^XhoI-<3aattcwvw 

^^^^wttoga A ^XhoI--criMgvvvvv 



V « vector sequence 
p = 5' phosphates 
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FIGURE 4b 

SUMMARY OF f^SSE^mUi FRDCEOUPE, 3' HAIF. 

a) kinased oligoners annealed in pairs and nixed in two groups (A & 

B) • 



BB303 BB305 EB307 

AGcrr p p 

A p p 



BB304 EB306 



BB308 



BB309 BB31X EB313 

P P - p 

-1? p p 



BB310 



BB312 



BB314 



-P 



BB315 



BB316 



BB317 BB319 EB321 BB323 

p P p p G 

-p P P p cmA 



BB318 



BB320 



BB322 



BB324 



b) 



oligansrs ligated together in two groi^. (BB303 and BB324 not 
kinased to avoid nultimerisation. 



AGCIT— Xhol- 
A~XhoI- 



ftagment c 



-A3CTC1C 
P 



fragment D 



c) The ligations were checked for the presence of togwent C & D on 

2% agarxDse gels then the ligation reactions v^ere Tr\iw\ and the 
reaction allowed to ocaitinue to give the final product. 



AGCrr— Xhol- 
A— Xhol- 



-ATOTCrO 



d) Uie HRP gene fragment was isolated on a 2% liST agarose gel and 

cloned into EooEEUI/HinDIII cut pUdS 



wwa AGCTT — ^Xhol- 
vwvttoga A— Xhol- 
P 



HRP 3' half 



-G aattcwvw 
■CTEAA gwvw 



Hinom 
wwaAGCTT— Xhol- 
wwttogaA— Xhol- 



HRP 3' half 



EocaiRI 

" Gaat tcwwv 
<!ITAAgwvw 



v = vector sequence 
p » 5' phoshates 



32 



EP 0 382 762 B1 



FXGORB 4c 

SUMTASY OF FINAL ASSEMBUT ITOCEDURE. 

a) 5' and 3' clones of HRP cloned in pJdB were digested with Xhol 

and EociRI. Relevant fragments traoi each digest were ifiolatigd 
froa a 0.8% ini agarose gel. 

HinOni HRP 5' HALF Xhol EodRI 
WWVAflGCrr CTOGftG~-GAft!ITCvvvvv 
WWVTrOGA A GAGCTCS~CrDtf^GVVVW 



HinDm Xhol HRP 3' HAIF EodRI 
vvvvvTrOGAA>~-GflGCI C CraAftGwvw 



b) XhoI/itooRI fragment carrying 3' half of HRP ligated into 

XhoI/EobRI cut HRP 5' half done. 



HRP 3' HAIF 

TOGAG G 

C -CITAA 



HRP 5' HAIF 

vwwAAGCrT C AATTCwwv 

wwvTTOGA A ■ GAGCT Gwvw 



c) Ooorpleted gene cloned in FiUC18. 



HRP 5' HAIF HRP 3' HAIF 
wvwAAGCr r CTOjA G GAAJTCvww 
vwwTrOGg A GAGCTC CTEAAGwwv 



V = vector sequence 
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Figure 5 . The HHP Expression Plasmid pSDlS, 
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Figure 6. Synthetic HHP Gene Modified for Expression in Mammalian 
Cells. 



Mkcswvifflmavvtgv 
AAGCTTCCACCATGAAGTGCTCCTGGGTGATCTTCTTCCTGATGGCCGTGGTGACCGGCG 
IB Ui JB 5T 

n SOQ LTPTFYDNSCPNVSNIV 
TGAACTCCCAGTTA ACCCCTACATTCTACGACAATAGCTGTCCCAACGTGTCCAACATCG 
70HpaI 80 90 100 110 120 

RDTIVKELRSDPRIAASILR 
TTCGCGACACAATCGTCAACGAGCTCAGATCCGATCCCAGGATCGCTGCTTCAATATTAC 
130 140 150 160 170 180 

LHFHDCFVNGCDASILLDNT 
GTCTGCACTTCCATGACTGCTTCGTGAATGGTTGCGACGCTAGCATATTACTGGACAACA 
190 200 210 220 230 240 

TSFRTEKDAFGNANSARGFP 
CCACCAGTTTCCGCACTGAAAAGGATGCATTCGGGAACGCTAACAGCGCCAGGGGCTTTC 
250 260 270 280 290 300 

VIDRMKAAVESACPRTVSCA 
CAGTGATCGATCGCATGAAGGCTGCCGT7GAGTCAGCATGCCCACGAACAGTCAGTTGTG 
310 320 330 340 350 360 

DLLTIAAQQSVTLAGGPSWR 
CAGACCTGCTGACTATAGCTGCGCAACAGAGCGTGACTCTTGCAGGCGGACCGTCCTGGA 
370 380 390 400 410 420 

VPLGRRDSLQAFLDLANANL 
GAGTGCCGCTCGGTCGACGTGACTCCCTACAGGCATTCCTAGATCTGGCCAACGCCAACT 
430 440 450 460 470 480 

PAPFFTLPQLKDSFRNVGLN 
TGCCTGCTCCATTCTTCACCCTGCCCCAGCTGAAGGATAGCTTTAGAAACGTGGGTCTGA 
490 500 510 520 530 540 

RSSDLVALSGGHTFGKNQCR 
ATCGCTCGAGTGACCTTGTGGCTCTGTCCGGAGGACACACATTTGGAAAGAACCAGTGTA 
550 560 570 580 590 600 

FIMDRLYNFSNTGLPDPTLN 
GGTTCATCATGGATAGGCTCTACAATTTCAGCAACACTGGGTTACCTGACCCCACGCTGA 
610 620 630 640 650 660 

TTYLQTLRGLCPLNGNLSAL 
ACACTACGTATCTCCAGACACTGAGAGGCTTGTGCCCACTGAATGGCAACCTCAGTGCAC 
670 680 690 700 710 720 
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VDFDLRTPTIFDNKYYVNLE 
TAGTGGACTTTGATCTGCGGACCCCAACCATCTTCGATAACAAGTACTATGTGAATCTAG 
730 740 750 760 770 780 

EQRGLIQSDQELFSSPNATD 
AGGAGCAGAAAGGCCTGATACAGAGTGATCAAGAACTGTTTAGCAGTCCAAACGCCACTG 
790 800 810 820 830 840 

TIPLVRSFANSTQTFFNAFV 
ACACCATCCCACTGGTGAGAAGTTTTGCTAACTCTACTCAAACCTTCTTTAACGCCTTCG 
850 860 870 880 890 900 

EANDRMGNITPLTGTQGQIR 
TGGAAGCCATGGACCGTATGGGTAACATTACCCCTCTGACGGGTACCCAAGGCCAGATTC 
910 920 930 940 950 960 

LNCRVVNSNSllhdmvevvd 
GTCTGA ACTGCAGAGTGGTCAACAGCAACTCTCTACTCCATGATATGGTGGAGGTCGTTG 

Psti 575 55T5 TOT? TTTDJ TUTS • 

£ V s s m * * 
ACTTTGTTAGCTCTATGTAATAAGGATCCGAATTC 
mo riRB ECORI 

KEY 

Underlined sequences indicate linkers used to adapt synthetic gene. 
Lower case residues indicate N and C terminal pre & pro sequences. 
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